Multi-modal Interface with Voice and Head Tracking for Multiple Home Appliances
نویسنده
چکیده
In this paper, we describe a multi-modal interface that allows use of voice and gesture commands for controlling distributed home appliances used by people with disabilities. The main objective of this study is combined with nonverbal and verbal interface for intuitive and efficient control that uses hands-free operation. The pointing gesture by facing as nonverbal interface represents selecting one of the home appliances. The voice commands as verbal interface represent button operation of the remote controller such as the power on/off, the channel select and the volume up/down. The prototype system can provide a hands-free remote controller for people with quadriplegia who do not have to send verbal commands for selecting home appliances.
منابع مشابه
The Ta2 Database - a Multi-modal Database from Home Entertainment
This paper presents a new database containing highdefinition audio and video recordings in a rather unconstrained video-conferencing-like environment. The database consists of recordings of people sitting around a table in two separate rooms communicating and playing online games with each other. Extensive annotation of head positions, voice activity and word transcription has been performed on...
متن کاملA Multi-Modal Database from Home Entertainment
This paper presents a new database containing highdefinition audio and video recordings in a rather unconstrained video-conferencing-like environment. The database consists of recordings of people sitting around a table in two separate rooms communicating and playing online games with each other. Extensive annotation of head positions, voice activity and word transcription has been performed on...
متن کاملFusion of Multi-modal Sensors in a Voxel Occupancy Grid for Tracking and Behaviour Analysis
In this paper, we present a multi-modal fusion scheme for tracking and behavior analysis in Smart Home environments. This is applied to tracking multiple people and detecting their behavior. To this end, information from multiple heterogeneous sensors (visual color sensor, thermal sensor, infrared sensor and photonic mixer devices) is combined in a common 3D voxel occupancy grid. Graph cuts are...
متن کاملA Multi-Modal System Intellectual Computer AssistaNt
The paper describes a multi-modal system ICANDO (an Intellectual Computer AssistaNt for Disabled Operators) developed by Speech Informatics Group of SPIIRAS and intended for assistance to the persons without hands or with disabilities of their hands or arms in human-computer interaction. This system combines the modules for automatic speech recognition and head tracking in one multi-modal syste...
متن کاملMulti-camera Tracking and Activity Recognition
This document describes the progress on the MUCATAR (MUltiple CAmera Tracking and Activity Recognition) IM2 White Paper Project during its second year. Building on the first year achievments on single-object tracking, the research during the second year moved into two main directions: 1) the investigation of new sampling strategies to improve tracking with particle filters, both for single and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001